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ABSTRACT 

Many  iterative  algorithms  for  the  solution  of  large  linear  systems  may  be  effectively 
vectorized  only  if  the  diagonal  of  the  matrix  is  surrounded  by  a  large  band  of  zeroes,  which 
is  called  the  zero  stretch.  In  this  paper,  a  multicolor  numbering  technique  is  suggested  for 
maximizing  the  zero  stretch  of  irregularly  sparse  matrices.  The  technique,  which  is  a  gen¬ 
eralization  of  a  known  algorithm  for  regularly  sparse  matrices,  executes  in  linear  time,  and 

•  ">  S*  i  "»  x 

produces  a  zero  stretch  approximately  equal  to  n  /2<f.  where  <T  is  the  number  of  colors  used 
in  the  algorithm.  For  triangular  meshes,  it  is  shown  that  or  ^3.  and' that  it  is  possible  to 
obtain  <jr=2  by  applying  a  simple  backtracking  scheme. 


This  work  is,  in  part,  supported  under  ONR  contract  N00014-85-K-0330.  and  the  Air  Force  contract 
AFOSR  84-0131. 
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1.  INTRODUCTION 

Iterative  solution  techniques,  in  general,  and  preconditioned  conjugate  gradient 
methods  (PCCG),  in  particular,  are  becoming  more  and  more  attractive  for  the  solution  of 
sparse  linear  systems.  Namely  because  these  methods  preserve  sparsity,  and  thus  allow  for 
the  solution  of  large  systems  without  resorting  to  auxiliary  (off  core)  storage.  This  is  a  big 
advantage  over  direct  solution  methods  that  destroy  sparsity.  Moreover,  the  availability  of 
supercomputers  contributed  to  the  popularity  of  the  iterative  techniques  by  providing  the 
computational  power  necessary  for  the  solution  of  huge  linear  systems,  and  yet  not  provid¬ 
ing  enough  memory  for  storing  the  systems  in  their  dense  (or  banded)  forms. 

However,  the  power  of  supercomputers  may  be  fully  exploited  only  for  computations 
which  are  vectorizable.  Unfortunately,  one  of  the  best  known  iterative  techniques  for  the 
solution  of  symmetric,  positive  definite,  linear  systems,  namely  the  incomplete 
factorization/PCCG  method  [11.  13],  requires  the  solutions  of  triangular  linear  systems  of 
equations.  The  algorithms  for  the  solution  of  such  systems  are  highly  recursive,  and  thus 
poorly  vectorizable.  More  specifically,  for  the  solution  of  a  linear  system  Ax  -  b  ,  each 
PCCG  iteration  requires  the  solution  of  two  triangular  systems  of  the  forms  (Z.  +D  )x  -b 
and  (L  +DY  x  =b  ,  where  L  is  a  strictly  lower  triangular  matrix  and  D  is  a  diagonal 
matrix.  Both  L  and  D  are  derived  from  A  ,  and,  for  irregularly  sparse  systems,  the  spar¬ 
sity  structure  of  L  +D  +L  r  is  identical  to  the  sparsity  structure  of  A  . 

The  recursive  solution  algorithms  for  (Z  +D  )x  =6  and  ( L  +D  Y  x  =b  are  only  vector¬ 
izable  to  the  extent  permitted  by  the  structure  of  L  .  More  specifically,  these  algorithms 
may  be  written  in  terms  of  vectors  of  length  A,  where  A  is  the  size  of  the  maximum  band 
which  contains  zeroes  below  the  diagonal  of  Z  (for  more  details  see  [15]).  A  more  precise 
definition  of  A  may  be  given  as  follows 

Definition:  The  zero  stretch  of  an  n  Xn  strictly  lower  triangular  matrix  Z  is  the  largest 

integer  A  such  that  a,  t  =  0  for  i  =  1 . n  and  i  <  j  ^  i—  A.  In  this  paper,  we  assume 

that  L  +Z)  +Z, T  has  the  same  sparsity  structure  as  A  ,  and  we  define  the  zero  stretch  of  A 
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to  be  equal  to  the  zero  stretch  of  L  . 

Hence,  for  the  efficient  solution  of  (Z.  +D  )x  —b  and  (Z  +Z)  Y  x  =6  on  vector  super¬ 
computers.  it  is  essential  to  rearrange  the  rows  and  columns  of  the  n  Xn  symmetric  matrix 
A  with  the  goal  of  maximizing  the  zero  stretch.  In  order  to  achieve  this  goal,  we  consider 
the  graph  GA  corresponding  to  A  .  More  precisely.  G4  contains  n  nodes  which  are  num¬ 
bered  by  the  integers  1.  ■  .n  such  that  the  node  numbered  i  corresponds  to  row/column  i 

in  A  .  and  G.*  contains  an  edge  from  node  i  to  node  j  if  and  only  if  the  (t  ,)  ),h  element  of 
A  is  non-zero.  Now.  interchanging  rows/columns  i  and  j  in  A  is  equivalent  to  inter¬ 
changing  the  numbers  of  the  two  corresponding  nodes  in  GA  .  In  other  words,  maximization 
of  the  zero  stretch  of  A  may  be  obtained  by  renumbering  the  nodes  of  GA  .  That  is  by 
assigning  to  each  node  i.  a  unique  number  v(i  ),  KKi)^n.  such  that  the 

difference  between  v(i  )  and  v{j )  for  any  two  neighboring  nodes  i  and  j  in  GA  is  as  large 
as  possible. 

Clearly,  the  problem  of  maximizing  the  zero  stretch  of  a  matrix  is.  in  some  sense,  the 
reciprocal  of  the  problem  of  minimizing  the  bandwidth  of  the  matrix  [18],  The  latter  prob¬ 
lem  is  one  of  minimizing  the  difference  I  v(i  )—v(j  )  I .  and  is  important  for  direct  solutions 
of  linear  systems.  Both  problems  have  been  proven  to  be  \P-hard  (see  [5.  10,  16]).  How¬ 
ever.  many  heuristics  have  been  studied  for  bandwidth  minimization  (see  for  examples 
(4.  6.  12.  21]).  while,  to  our  knowledge,  the  problem  of  zero  stretch  maximization  have  been 
studied  only  in  the  context  of  matrices  resulting  from  the  discretization  of  partial 
differential  equations  (PDE)  on  regular  grids  (see  for  example  [1.7,8.  14.  17.  19]). 

If  a  matrix  A  results  from  the  discretization  of  a  PDE  on  a  regular  grid,  then  its 
corresponding  graph.  GA  .  has  a  repetitive  structure  which  allows  for  a  simple  numbering 
scheme.  Very  briefly,  a  systematic  way  may  be  used  to  color  the  nodes  in  GA  such  that 
any  two  adjacent  nodes  have  different  colors.  Then,  consecutive  numbers  are  assigned  to 
the  nodes  that  have  the  same  color.  If  nc  is  the  number  of  nodes  assigned  the  color  c  and 
ti  min  =  minln,.  }.  then  the  zero  stretch  resulting  from  the  multicolor  numbering  is  approxi- 


mately  equal  to  nmin. 


Multicolor  numbering  schemes  may  be  generalized  and  applied  to  pierced  rectangular 
grids  [14.  15]  and  to  irregular  grids.  In  this  paper,  a  multicolor  numbering  of  irregular  tri¬ 
angular  grids  is  considered,  and  its  effect  on  the  zero  stretch  of  the  corresponding  matrices  is 
studied.  Both  theoretical  and  experimental  results  are  presented. 

The  numbering  algorithm  is  described  in  Section  2.  Briefly,  the  basic  step  in  the  algo¬ 
rithm  is  the  construction  of  a  level  structure  similar  to  the  one  used  in  the  Cuthill-McKee 
algorithm  [4],  The  levels  are,  then,  grouped  into  two  groups,  namely  the  odd  numbered 
levels  and  the  even  numbered  levels,  and  the  nodes  in  each  level  are  colored  such  that 
neighboring  nodes  have  different  colors.  Finally,  the  nodes  which  have  the  same  color  in 
the  odd  numbered  levels  are  numbered  consecutively,  and  the  same  is  repeated  for  the  even 
numbered  levels. 

The  efficiency  of  the  algorithm  is  studied  in  Section  3.  and  a  lower  bound  on  the 
resulting  zero  stretch  is  obtained.  This  bound  shows  that  larger  zero  stretches  may  be 
obtained  by  reducing  both  the  number  of  colors,  cr,  needed  to  color  each  level,  and  the 
difference.  a,nax.  between  the  number  of  nodes  in  the  odd  numbered  levels  and  the  number 
of  nodes  in  the  even  numbered  levels. 

The  theoretical  lower  bound  on  the  zero  stretch  motivates  two  modifications  of  the 
basic  algorithm.  The  first  modification,  which  is  described  in  Section  4.  aims  at  the  reduc¬ 
tion  of  amin.  and  the  second,  which  is  described  in  Section  5,  aims  at  the  reduction  of  tr. 
Finally,  in  Section  6.  the  results  of  running  the  algorithm  on  three  irregular  meshes  are 


presented  and  discussed. 
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2.  DESCRIPTION  OF  THE  ALGORITHM 

The  class  of  meshes  considered  in  this  study  consists  of  irregular  meshes  of  triangular 
elements,  where  each  element  contains  three  nodes  located  at  the  corners  of  the  triangle.  If 
a  mesh  in  the  above  class  is  used  to  discretize  a  PDE.  then  the  resulting  matrix  A 
corresponds  to  a  graph  GA  which  has  the  same  topology  as  the  mesh  itself.  For  this  reason, 
we  will  not  differentiate,  from  now  on,  between  the  mesh  and  the  graph  . 

Given  a  graph  GA  in  which  nodes  are  identified  by  the  integers  1.  •  ji ,  the  problem 

is  to  find  the  numbering  function,  v{\ ,n  ]  -*  [l.n  ].  which  maximizes  the  following 

A  =  min{  I  v(i  )—v(  j  )  I  ;  i  and  j  are  neighbors  in  GA  } 

Unfortunately,  the  number  of  possible  numbering  functions  is  n!  (factorial  n).  and 
the  specific  numbering  that  maximizes  A  may  not  be  found  without  trying  (in  the  worst 
case)  all  of  the  n  !  numberings.  In  the  remaining  of  this  section,  we  will  describe  an  algo¬ 
rithm  which  executes  in  linear  time  and  finds  a  numbering  that  produces  a  large  A.  The 
basic  idea  of  the  algorithm  is  the  inclusion  of  each  node  of  GA  into  one  of  t  lists,  namely 
L1.  •  •  ■  ,L‘ .  such  that: 

1)  Any  two  neighboring  nodes  in  GA  are  not  in  the  same  list.  That  is.  any  given  list 
contains  independent  nodes,  where  two  nodes  are  called  independent  if  they  are  not 
neighbors. 

2)  Let  <f>  be  the  smallest  constant  such  that  if  the  k'h  node  in  some  list  and  the  l'h 
node  in  another  list  are  neighbors  in  GA  ,  then  I  l  —  k  1^0.  The  order  of  the  nodes 
in  each  list  should  lead  to  a  small  0. 

Given  these  lists,  the  nodes  in  the  first  list  may  be  numbered  consecutively,  followed 
by  the  nodes  in  the  second  list,  then  the  third  ,  and  so  on.  If  the  number  of  nodes  in  list  L‘ 
is  denoted  by  I  L  1  I  .  then,  the  above  scheme  gi\ 


1 


! 

< 


A  ^  min  {  I  L '  1 1  —  0 


(1) 


Assuming  that  (f>  «  mini  I  L‘  I }.  then,  larger  values  of  A  may  be  obtained  by  maxim¬ 
izing  the  size  of  the  shortest  list.  That  is,  by  using  the  minimum  number  of  lists  and  mak¬ 
ing  the  lists  approximately  equal  in  size.  These  were  the  major  factors  that  guided  the 
design  of  our  algorithm. 

The  first  step  toward  the  construction  of  the  lists  Ll,  ■  ■  ■  ,L'  is  the  generation  of  a 
level  structure  (Vj.  •  •  •  ,Vm  }  that  covers  GA  .  This  structure  is  similar  to  the  one  used  in  the 
Cuthill-Mckee  algorithm  [4].  More  specifically,  the  generation  starts  from  an  initial  level 
V i  and  proceeds  inductively  such  that,  given  any  level  Vu .  the  next  level  Vu+1  is  generated 
by  including  in  it  any  node  which  is  a  neighbor  to  a  node  in  Vu  and  which  have  not  been 
included  in  any  of  the  previous  levels  V 1.  ■  ■  •  ,VU .  The  number  of  nodes  in  is  denoted 
by  I  Vu  I .  Note  that,  by  construction,  neighboring  nodes  may  lie  only  on  the  same  level  or 
on  adjacent  levels.  That  is.  a  node  in  a  given  level  Vu  is  independent  of  any  node  in  levels 
Vw  ,-w  >u  +1  or  w  <u  — 1. 

The  nodes  in  each  level  Vu  are  then  partitioned  into  the  minimum  number  of  sets 
V*,V*,  ■  •  •  ,  each  containing  only  independent  nodes.  In  other  words,  no  two  nodes  in  the 
same  set  VJ  may  be  neighbors.  In  Section  3.  it  is  proved  that  three  sets  are  always  sufficient 
to  accomplish  this  partition  for  any  level.  Hence,  if  the  number  of  sets  required  to  partition 
level  Vu  is  denoted  by  su  ,  then.  su  ^3.  Let  cr  =  max  }.  and  for  any  level  u  with 

su  <  cr.  define  the  sets  VJ .  i  -su  +1.  ■  •  •  ,<r  to  be  empty. 

Now.  it  can  be  readily  seen  that,  by  picking  one  set  from  each  alternate  level,  a  list  of 
independent  nodes  may  be  compiled.  For  example,  all  the  nodes  in  V  /  ,V3'  ,  V'j1  .  •  •  are 
independent,  and  hence,  may  be  included  in  one  list.  Using  this  approach,  all  the  nodes  of 
Ga  may  be  distributed  into  2<r  lists  of  independent  nodes;  a  of  these  lists  may  be  con¬ 
structed  from  the  odd  numbered  levels,  and  the  other  cr  from  the  even  numbered  levels.  In 
Fig  1.  a  logical  representation  of  the  level  structure  is  shown,  where  two  sets  of  indepen¬ 
dent  nodes  are  connected  by  an  edge  if  and  only  if  there  exists  two  nodes,  one  in  each  set. 
that  are  neighbors  in  .  A  list  is  highlighted  in  the  figure  by  using  the  same  hashing 
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direction  for  all  of  its  sets. 


HI  L'  %  l1  i‘  =  L*  Vt  l>  ^  *■' 

Fig  1  -  The  partitioning  of  the  level  structure  icr-3) 


However,  the  above  construction  may  produce  lists  that  vary  considerably  in  size, 
with  some  lists  being  very  small.  By  (1),  this  leads  to  a  small  A.  In  fact,  the  largest  A  is 
obtained  if  the  2cr  lists  are  of  equal  size. 

It  is  possible  to  balance  the  size  of  the  lists  generated  from  odd  (or  even)  numbered 

/ 

levels.  For  example,  assume  that  the  lists  L1,  ■  ■  ■  .La  are  generated  by  picking  sets  from 
the  odd  numbered  levels.  If  after  picking  sets  from  V  vV 3.  ■  ■  ■  ,VU  _2.  the  lists  are  ordered 
(and  renamed)  such  that  their  sizes  satisfy  \Lk  I  ^  \Lk*1 1 ,  for  k  =1.  •  •  ,cr— 1.  then,  the 
sizes  of  the  lists  may  be  balanced  by  first  renaming  the  sets  Vj,  •  •  •  ,Vua  such  that 
I  V*  I  ^  I  V*  +l  I .  and  then  adding  Vk  into  Lk  for  k  =1.  •  •  ,<r.  If  this  process  is  applied  at 
all  the  odd  levels,  u  =1,3,  •  •  ■  .  then,  it  is  straight  forward  to  prove  the  following  using  a 
recursive  argument: 

I  I L'  I  —  \Lj  I  I  ^  max)  I  V*  I  :  u  = odd  .  and  k  =1,  •  •  •  ,<r)  i  .j  ^  <x  (2.a) 

where  the  outer  most  vertical  bars  in  the  left  side  of  the  inequality  are  used  to  indicate  the 
absolute  value  of  a  signed  integer.  Similarly,  if  the  lists  Z,0"*'1,  •  ■  ,L2a  are  generated  from 
the  even  numbered  levels,  then 
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I  \Ll  I  —  I  Li  I  I  ^  maxj  I V*  I  ;  u  —even  ,  and  k  =1.  •  ••  ,<r}  i  ,j  ^  cr+1  (2.b) 

With  the  above  strategy,  the  size  of  the  lists  Ll ,  ■  ■  ■  ,LC .  as  well  as  the  size  of  the  lists 
L <r+1.  •  •  •  X2<r  may  be  balanced.  However,  the  size  of  a  list  L‘  ,i  ^cr  relative  to  another  list 
L*  .  j  > a.  depends  on  the  total  number  of  nodes  in  the  odd  and  even  numbered  levels. 
That  is,  I  L‘  I  ==  I  L 1  I  only  if  =odd  I  Vu  I  =  £a  _,VCT  I  Va  I .  For  the  examples  considered 
in  Section  6,  these  sums  were  approximately  equal.  However,  should  there  be  a  big 
difference  between  £,u  -odd  I  Vu  I  and  £,u=tven  I  I .  then  the  3-way  grouping  algorithm 
described  in  Section  4  may  be  used. 

In  brief,  the  multicolor  node  numbering  scheme  may  be  summerized  as  follows: 

NODE  NUMBERING  ALGORITHM 

Initially,  input  the  nodes  in  level  V  j.  and  set  u  =1 

1)  partition  Vj  into  the  sets  V /  ,  •  •  ■  .Vj1  such  that  each  set  contains  independent 
nodes,  and  I  l  ^  I  +1  I .  k  =1,  ■  •  •  ,s  j— 1. 

2)  REPEAT  until  every  node  is  in  some  level 

2.1)  u  —  u  +1 

2.2)  Construct  the  next  level  Vu 

2.3)  Partition  Vu  into  the  sets  Vul.  ■  ■  ■  .V’“  such  that  the  nodes  in  each  set  are 

independent,  and  I  V^'  I  ^  I  V*  +1 1 ,  k  =1,  ■  —1. 

3)  Let  m  be  the  number  of  levels  obtained  in  step  2.  and  set 
tr  =  max{ru  :  u  =1.  ••  •  jn  }. 

4)  FOR  i  =1,  •  ■  •  .2 (7,  let  L'  =  the  empty  list. 

FOR  u  =1,  ■  ■  •  ,m  ,  with  an  increment  of  2,  DO 

4.1)  Find  two  permutations:  n  on  [l.cr]  and  p  on  [cr+1 ,2cr],  such  that 

IL1r<*)|  ^  IL’ra+1)l.  k  =1.  •  ■  ■  .cr-1. 

I  L pa  )  I  ^  \Lp(k+l)\.  k  =cr+l.  ,2<t—1. 


4.2)  FOR  k  *1.  ■  •  •  ,o\  append  the  nodes  in  V*  to  the  list  L  \ 

4.3)  FOR  k  =cr+l,  •  ■  •  .2<r,  append  the  nodes  in  V£Z\  to  the  list  Lp(k  \ 

5)  A=0 

FOR  i  =1.  •  •  •  ,2cr  DO 

FOR  each  node  e  in  the  list  L  ‘  DO 
A=A+1  :  v(e  )  =  A 

It  is  easy  to  see  that  the  execution  time  of  the  above  algorithm  is  proportional  to  the 
number  of  edges  in  GA  .  More  specifically,  the  most  time  consuming  part  in  the  algorithm  is 
the  construction  of  the  level  structure,  which  depends  on  the  number  of  edges  in  the  graph. 
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k  =1 

II 

to 

u  =1 

1 

2 

u  =2 

3.6.8 

4.7 

u  =3 

5.9 

10 

u  =4 

11.13 

12 

u  =5 

18.14.17 

21.16 

Ll  =  (1.18.14.17) 

L2  =  (2.10.21.16.26) 
L 3  =  (5.9.15.23) 

L 4  =  (3.6.8.22) 

L5  =  (4.7.12.24.19) 
L*  =  (11.13.25.20) 


(a)  an  arbitrary  numbering 


(b)  the  construction  of  the  lists 


(c)  The  multicolor  numbering 
Fig  2  -  The  numbering  of  an  irregular  mesh. 
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In  Fig  2,  the  algorithm  is  applied  to  a  simple  mesh  which  consists  of  26  nodes.  The 
levels  are  depicted  by  bold  lines,  and  the  sets  of  independent  nodes  within  each  level,  as 
well  as  the  lists  of  independent  nodes  are  identified  in  Fig  2(b)  in  terms  of  the  arbitrary 
numbering  used  in  Fig  2(a).  The  numbering  resulting  from  applying  the  algorithm  is 
shown  in  Fig  2(c).  from  which  it  may  be  seen  that  A  =  3.  If  {24,25.26}  is  chosen  as  a  start¬ 
ing  level,  or  if  the  backtracking  algorithm  described  in  Section  4  is  applied,  then  the  value 
of  cr  may  be  reduced  from  3  to  2.  and  A  =  4  is  obtained.  For  this  small  problem,  it  was 
possible  to  find  the  numbering  which  maximizes  A  by  doing  an  exhaustive  search  which 
took  382  hours  to  execute  on  a  SUN 3  workstation  (a  problem  with  28  nodes  would  need 
27x28x382  hours).  The  largest  A  was  found  to  be  equal  to  7  for  this  mesh. 

In  the  next  section,  a  lower  bound  on  the  value  of  the  zero  stretch,  A,  resulting  from 
the  algorithm  is  established- 
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3.  ANALYSIS  OF  THE  ALGORITHM 

The  problem  of  partitioning  the  nodes  in  a  level  into  sets  of  independent  nodes  is 
equivalent  to  the  problem  of  coloring  the  nodes  in  the  level  such  that  any  two  neighboring 
nodes  have  different  colors.  Given  such  a  coloring,  the  sets  may,  then,  be  formed  by  includ¬ 
ing  nodes  with  the  same  color  in  the  same  set.  Hence,  from  now  on.  if  a  minimum  of  s 
independent  sets  are  required  to  partition  a  specific  level,  then  we  will  say  that  this  level  is 
s  -colorable. 

In  this  section  we  analyze  the  performance  ot  the  algorithm  and  obtain  a  lower  bound 
on  A  produced  by  the  algorithm.  We  need  to  digress  slightly  in  doing  so  and  study  certain 
properties  of  planar  and  outerplanar  graphs.  Specifically,  we  proceed  as  follows  :  first  we 
observe  that  GA  is  a  planar  graph;  next  we  show  that  when  the  algorithm  is  executed  on  a 
planar  graph,  if  the  initial  level  taken  by  the  algorithm  forms  an  outerplanar  graph,  then 
each  of  the  subsequent  levels  also  forms  an  outerplanar  graph;  then,  we  show  that  any 
outerplanar  graph  can  be  colored  using  no  more  than  three  colors;  finally  this  result  is  used 
to  obtain  lower  and  upper  bounds  on  the  performance  of  the  algorithm. 

3.1.  Outerplanar  graphs 

Formally.  [2.  3.  20],  an  outerplanar  graph  is  defined  as  follows: 

Definition:  A  graph  is  said  to  be  planar  if  there  exists  a  mapping  of  its  vertices  and  edges 
into  the  plane  such  that  a)  each  vertex  v  is  mapped  into  a  distinct  point  v'  in  the  plane,  b) 
each  edge  (v.w  )  is  mapped  onto  a  simple  curve  having  end  points  (v’  ,w‘  ).  so  that  c)  the 
mappings  of  the  edges  meet  only  at  common  end  points.  A  planar  graph  is  said  to  be  outer¬ 
planar  if  it  can  be  embedded  in  the  plane  such  that  all  vertices  lie  on  some  contour  and  all 
the  edges  lie  inside  this  contour  (see  Fig  3(a)). [] 

Definition:  A  subdivision  of  a  graph  G  is  a  graph  G' .  obtained  from  G  by  replacing  some 
edge  ( u  ,v  )  by  a  new  vertex  w  together  with  the  edges  (u  .w  )  and  (w  ,v  ).  A  graph  H  is 
homeomorphic  from  a  graph  G  if  //  can  be  obtained  from  G  by  a  finite  sequence  of  subdi 


-  12  - 


<t> 

(a)  An  outerplanar  graph  (b)  Airgraph  (c)  A  ^2,3  graph 

Fig  3  -  Some  planer  graphs 

visions.  [] 

The  following  result  is  known  previously  [2]: 

Theorem:  Let  K4  denote  the  complete  graph  on  4  vertices  and  AT2.3  the  complete  bipartite 
graph  on  5  vertices  (see  Fig  3).  Then  a  graph  G  is  outerplanar  if  and  only  if  G  has  no  sub¬ 
graph  homeomorphic  from  K  4  or  K 2.3.  [] 

3.2.  An  upper  bound  on  <7 

In  order  to  determine  the  maximum  number  of  colors  needed  to  color  the  nodes  in  any 
particular  level,  we  start  by  proving  the  following: 

Proposition:  Let  G  =(V  £ )  be  a  planar  graph  and  let  S  =(V'  JT )  be  a  connected  subgraph 
of  G .  If  S=Vk  for  some  k  during  the  execution  of  the  node  numbering  algorithm  of  Sec¬ 
tion  2.  then  Vk  +1  is  outerplanar. 

Proof:  Assume,  for  contradiction,  that  Vk  is  not  outerplanar.  Thus  it  must  have  a  sub¬ 
graph  homeomorphic  from  K4  or  A2.3.  Since  we  are  investigating  meshes  with  triangular 
elements  only,  we  observe  that  if  Ga  contains  a  subdivision  of  K 4  or  A' 2.3.  then  it  contains 
K4  or  K2,3  respectively.  Assume  that  K4  is  a  subgraph  of  1^+,  and  number  its  vertices 
1.2. 3.4  arbitrarily.  Since  S  is  connected  and  GA  is  planar,  S  must  be  entirely  contained 
either  in  one  of  the  internal  regions  (1.4.3).  (1,2.4).  and  (2,3.4)  or  must  be  outside  the  K4 
(that  is.  no  vertex  of  S  is  in  any  of  these  regions)  in  any  planar  embedding  of  GA  •  It  is  clear 


./  ■/  V'  •.*  %■«-. 

*  m  *  M  *  m  *  _ m 
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that  whatever  be  the  case,  at  least  one  edge  connecting  a  vertex  of  S  to  one  of  the  vertices 
of  K 4  must  necessarily  cross  one  of  the  edges  of  K 4.  Similarly  we  can  show  in  the  case  of 
K  2  3  that  at  least  one  vertex  of  K 2  3  cannot  be  connected  to  any  vertex  of  S  without  cross¬ 
ing  an  edge  in  any  planar  embedding  of  GA  .[] 

In  the  statement  of  the  above  proposition,  it  is  necessary  that  S  be  connected.  Counter 
examples  can  be  easily  constructed  with  S  spread  across  several  regions  of  K 4  or  K22  while 
still  guaranteeing  the  planarity  of  GA  .  The  proposition  implies  that  if  the  initial  level 
chosen  by  the  algorithm  is  connected,  then  each  of  the  levels  generated  later  is  an  outer- 
planar  graph.  This  is  true  even  if  the  levels  other  than  the  initial  level  are  not  connected 
since  the  planarity  of  GA  ensures  that  no  level  generated  by  the  algorithm  is  spread  across 
the  regions  of  a  K4  or  #2.3.  Mote  that  the  connectedness  of  the  initial  level  is  a  sufficient 
and  not  a  necessary  condition.  In  other  words,  it  is  possible,  for  some  meshes  to  start  with 
an  initial  level  that  is  not  connected  and  yet  generate  only  outerplanar  graphs  as  the  subse¬ 
quent  levels. 

The  following  result  concerning  the  3-colorabilitv  of  outerplanar  graphs  was  known 
previously  [2.  3.  20],  However,  we  have  not  been  able  to  find  an  explicit  algorithm  in  the 
literature  which  actually  provides  the  coloring.  We  present  such  an  algorithm  here,  which 
is  actually  programmed  as  part  of  the  experiments  reported  in  Section  6. 

Theorem:  Any  outerplanar  graph  is  3-colorable. 

Proof:  We  consider  outerplanar  graphs  which  do  not  have  isolated  nodes.  The  coloring  of 
isolated  nodes  is  trivial  becauser  they  do  not  have  any  neighbor.  Given  an  outerplanar 
graph  G  with  N  nodes,  we  first  line  up  its  nodes  on  a  straight  line  such  that  all  the  edges 
are  on  one  side  of  the  line.  This  is  always  possible  by  chosing  any  two  adjacent  nodes  on 
the  contour  of  the  planer  embedding  of  the  outerplanar  graph,  and  considering  these  two 
nodes  to  be  the  first  and  the  last  node  in  the  node  lineup.  All  other  nodes  may  be  lined  up  in 
the  order  of  their  appearance  on  the  contour  (see  Fig  4).  In  order  to  identify  the  nodes,  we 
number  them  sequentially  starting  at  one  of  the  end  nodes. 


Fig  4  -  A  different  embedding  (a  line  up)  of  the  graph  of  Fig  3(a) 

For  any  two  neighboring  nodes  with  numbers  v  and  w  (say  v  <  w  ),  we  define  the 
region  r  (v  ,w  )  to  be  the  set  of  nodes  with  numbers  between  v  and  w  .  including  v  and  w  . 
Note  that  if  w  =v  +1,  then  r  (v  ,w  )  consists  of  only  w  and  v  .  The  following  statement  fol¬ 
lows  from  the  definition  of  outerplanarity:  if  a  vertex  u  .  different  from  v  and  w  .  is  in 
r  (v  ,w  ).  then  any  region  r  ( u  ,e  ).  which  has  u  as  one  of  its  end  nodes,  should  be  properly 
contained  in  r  (v  ,w  ).  The  nesting  level  of  a  region  is  the  number  of  regions  which  properly 
contain  that  region.  For  example,  a  region  r  (v  ,w  )  is  called  O-nested  if  it  is  not  contained  in 
any  other  region.  That  is.  if  there  do  no  exist  two  nodes  e  j  <  v  and  e2  ^  w  .  such  that  e  j 
and  e2  are  neighbors,  and  at  most  one  of  e  t  or  e2  is  equal  to  v  or  w  .  respectively. 

The  coloring  strategy  is  recursive  with  respect  to  the  nesting  level.  More  specifically, 
assume  that  r(ui.Wi).  r(u2.w2).  •  •  •  .  r  (ut  ,wt ).  k  ^  1.  are  the  O-nested  regions  in  G  . 
Noting  that  u  j=l  and  wk  =N  .  we  define  the  set  R0(l.N  )  to  contain  all  the  end  nodes  of  the 
O-nested  regions  between  nodes  1  and  N  .  That  is  0C 1  )  =  [u,  .w,  I  i  =1 . k  }.  Clearly. 


for  any  two  consecutive  regions,  either  v,  =  or  v,  =  w,  +1  —  1.  In  the  former  case, 
node  V;  has  two  neighbors  among  the  nodes  in  ^0(l  JV ),  and  in  the  latter  case,  v,  has  only 
one  neighbor  among  the  nodes  in  R0(l.N ).  This  argument  applies  to  any  node  in  Roil.N  ). 


and  hence,  it  is  possible  to  color  all  the  nodes  in  R0(1.N  )  using  only  three  colors  (call  these 
colors  c  1.  c  2  and  c  3).  In  fact,  this  last  statement  is  true  even  if  the  colors  of  the  first  and 
last  nodes,  namely  1  and  N  ,  are  bound  a  priori  to  two  of  the  colors  c  1 .  c  2  and  c  3. 


After  coloring  the  nodes  in  R0(l,N).  we  consider  the  O-nested  regions  consecutively. 
Each  O-nested  region  r(ut  ,w,  )  is  either  a  trivial  region  which  contains  only  nodes  u  and 
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w, .  or  it  contains  at  least  two  1-nested  regions  r  (u,  j.w,  j).  •  .  r(ulf.wlfi).  where. 

u,  x-ut  ,  =w,  and  p  ^2.  Given  that  u,  and  wt  are  already  colored  with  two  different 

colors  from  {cl.c2.c3),  then  nothing  has  to  be  done  in  the  former  case.  If.  however,  the 
latter  case  applies,  then  we  consider  the  set  R  jfu,  .w,  )  which  contains  the  end  nodes  u ,  and 

Wjj  .  j—  1 . p  .  of  the  1-nested  regions  between  u,  and  w;  .  Following  an  argument  similar 

to  the  one  used  for  R0(i.N  ).  it  may  be  shown  that  the  nodes  in  R \(u,  .w  )  may  be  colored 
using  only  c  1,  c  2  and  c  3. 

After  coloring  the  nodes  in  #i(u,  ,w, ),  we  repeat  the  above  process  with  the  2-nested 

regions  contained  in  each  1-nested  region  r  (u,  j  ,w,  j  ).  i  =1 . p  .  When  the  recursive  process 

terminates,  all  the  nodes  in  G  will  be  colored.  Termination  is  guaranteed  because  every 
region  in  G  has  a  finite  nesting  level.  [] 

Corollary;  Each  of  the  levels  generated  by  the  node  numbering  algorithm  of  Section  2.  is 
3-colorable.  [] 

3.3.  A  lower  bound  on  A 

In  this  section.  Equation  (1)  is  used  to  derive  <  lower  bound  on  A  in  terms  of  the  total 
number  of  nodes  n  in  the  graph  G^  .  First,  some  terminology  is  introduced:  Let 
S,ytn  u  =  {V,  ;  i  =even  and  i  ^u)  be  the  set  that  contains  the  even  numbered  levels,  up  to 
level  u .  and  let  nfvtn  u  be  the  total  number  of  nodes  in  these  levels,  that  is 
n,ien  M  =  Z 1  v,  1  ■  Let  S ^  u  =  {V,  :  i  =odd  and  i  )  and  nljdd  „  be  defined  in  a 
similar  way.  Let  also  au  be  the  difference  1  „  —  nrHiJ  „  .  and  denote  the  maximum  of 

<xu  for  u  =1,  •  ,m  by  amiX. 

The  lists  L '  .  i  =  1 2 cr  are  constructed  in  the  algorithm  of  Section  2  incrementally  by 

considering  the  leve.s  1.2.  in  order  Let  L'  be  the  initial  part  of  l.'  which  contains 
those  nodes  that  are  in  levels  1.  ,u  .  and  let  l,‘  be  the  number  of  nodes  in  Clearly. 
l,'„  -  I  /- '  I .  Finally,  let  be  the  maximum  number  of  nodes  in  a  level,  and  vj1)ax  be  the 

maximum  number  of  nodes  in  a  set  of  independent  nodes  in  any  level  More  precisely. 


KBPS 


v m»x  —  max{  I  V,  I  :  i  -1,  •  •  ■  ,m  }  and  v^,4X  =  max!  I  V*  I  ;  i  =1 m  and  k  =1 cr}. 

With  this  notation,  equs  (2)  may  be  written  in  a  more  general  form.  More 
specifically,  for  any  u  . 

1 K  ~  1  ^  v^ax  IF  (i.j  ^  cr)  OR  (i  ,j  >  cr)  (3) 


<r  2<t 


But.  £  1‘  -  M  .  and 

I 

ij 

*  ntven  M  ■  This,  together  with  (3)  gives 

i  =i 

i  =<r+l 

nodd  M  (tT  vmax 

si; 

< 

^odd  ,u  "h  v  max 

if  i  ^  or 

(4.a) 

cr 

N 

cr 

nevm  ju  (CT— 1 )  V  max 

ZIJ 

< 

^ even  ju  ^  ^max 

if  i  >  cr 

(4.b) 

cr 

s 

cr 

The  above  two  inequalities,  may  then  be  used  to  compare  the  number  of  nodes  in  two  lists 
L'u  and  LJ  for  i  <cr  and  j  >c r.  Namely. 


|  ^  i  _ ij  |  <  ^  ntvtn  m  nodd  m  ^  2(<T  1 )  v  m 


if  t  ^cr  and  j  ><J  (5) 


From  (3)  and  (5).  the  following  bound  may  be  obtained  for  any  i  and  j  . 
Uj-tfl  ^  ^-f.2(<r~1)v™»  for  i  ,j  ^2cr 


(6) 


In  order  to  apply  (1).  we  first  need  to  estimate  a  lower  bound  on  l^.  Noting  that 
nodd  jm  +  rtrven  /n  ~  ri  •  the  total  number  of  nodes  in  GA  .  and  applying  the  definition  of  amax. 
we  may  obtain. 


n  amax  .  -y  n  Qmax 

*  - ; -  and  n^d*.  2  - - - 


which,  together  with  (4)  gives 


i,  -i  n  amax  2(<T  l)vmax 

J?  - - - 

2  <r 


for  i=l.  -  -  .2(7 


(7) 


Next,  we  need  to  estimate  an  upper  bound  on  the  value  of  <f>  which  appears  in  (1).  For 
this,  we  consider  any  two  nodes  e  j  and  e  2  from  two  different  lists  L  '  and  L  ,  respectively, 
such  that  ei  and  e2  are  neighbors.  Being  neighbors,  the  two  nodes  should  either  be  in  the 


same  level,  or  be  on  two  consecutive  levels.  This  means  that  there  exists  a  level  u  such  that 
e  i  and  e2  are  not  in  L  ‘  and  LJ.  respectively,  but  are  in  Z,u‘+2  and  LJ+2  ■  respectively.  How¬ 
ever..  Lu+2  may  contain  at  most  vj„ix  nodes  which  are  not  in  and  the  same  applies  to 
Li +2  and  Li.  Hence,  from  the  definition  of  <f>  we  get 

I  lu  —  li  I  +V^X 
which  from  (6)  gives 

u.  <  atn*x  (3^  2)  vmtx 

0  ^  -  -  (g) 

Finally,  by  substituting  (7)  and  (8)  into  (1)  and  noting  that  2^a^3  and  that 
vnux^2vmax  (guaranteed  by  the  algorithm  for  coloring  the  levels),  we  may  obtain  the  fol¬ 
lowing  bound  for  A 

^  ^  2<X  T  (9) 

Experimental  results,  including  those  for  the  test  problems  of  Section  6.  show  that 
vm»x  is  of  the  order  of  \/n" .  Hence,  for  large  values  of  n  .  vmlx  is  much  smaller  than  n  . 
Moreover,  if  the  number  of  nodes  in  the  odd  and  even  numbered  levels  are  approximately 
equal,  then  am„  is  also  smaller  than  n  .  and  (9)  may  be  written  in  the  form 

2a  6 

where  A"  is  a  small  number. 


Although,  for  the  meshes  considered  in  Section  6.  amax  is  much  smaller  than  n  .  it 
seems  that,  without  additional  restrictions  on  the  topology  of  the  mesh,  the  value  of  amax 
may  be  of  the  order  of  n  .  For  example,  a  mesh  of  the  type  shown  in  Fig.  5  results  in 

amax  ^  '  w^ere  /  =  —  D/2  —  1  and  d  is  the  maximum  number  of  neighbors 

per  node  {d  —9  in  the  figure).  For  this  type  of  meshes,  the  odd/even  grouping  of  the  levels 
leaves  the  value  of  ctmax  without  any  control.  In  the  next  section,  we  discuss  a  grouping 
approach  which  overcomes  this  problem. 
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4.  A  THREE  LEVEL  GROUPING  MECHANISM 

The  central  concept  in  the  algorithm  of  Section  2  is  to  group  the  levels  in  GA  into  two 
sets  of  levels,  namely  Sfvm  ^  and  S^d  jv  •  such  that  the  levels  in  each  set  are  independent. 
In  this  section,  we  replace  the  odd/even  grouping  of  the  levels  by  another  grouping  which 
tries  to  balance  the  number  of  nodes  in  each  group.  The  flexibility  required  to  maintain  this 
balance  may  be  provided  if  three,  rather  than  two.  groups  are  used  as  explained  thereon. 

Given  the  levels  V ■  ■  ,Vm  .  it  is  required  to  distribute  these  levels  into  three  sets, 
namely  S  i/n  .  S  and  S  3^, .  such  that: 

1)  Any  two  consecutive  levels  Vu  and  Vu+1  should  be  in  two  different  sets.  This 
guarantees  that  the  levels  in  each  set  are  independent,  and 

2)  For  i  ,j  =1.2,3,  nl/n  =  n;  ^  ,  where  nl/n  is  the  total  number  of  nodes  in  the  lev¬ 
els  that  are  contained  in  S,  ^  .  More  precisely,  n,  „  =  ts,  1  I 

The  sets  S,  „  .  i  =1.2.3  may  be  constructed  inductively.  More  specifically,  let  S,  u_i, 
i  =1,2.3  be  the  sets  that  were  constructed  from  the  levels  V and  assume, 
without  loss  of  generality,  that  n  ^  n3u_j.  Then,  in  order  to  keep  the  size  of 

the  sets  balanced,  it  is  natural  to  add  level  Vu  to  the  set  that  contains  the  least  number  of 
nodes.  In  other  words: 

•Si.«  =Si.u_,  U  !  •  $2,u  ~  S 2.u  —i  and  53o  =S3o_i 

Now  if  the  sizes  of  S,  „  _j.  i  =1.2.3  differ  by  at  most  vmax  =  max  (  I  \\  I  ).  that  is  if 

1^4  £  W 

^  2.u  -1  ^  1  .u  —  1  ^  ^  3.^  —1  l.u  —  1  ^  max 

then  it  may  be  shown  that  after  the  addition  of  level  u  ,  the  sizes  of  the  new  sets  may  not 
differ  by  more  than  vmax.  that  is 

-n]M  I  <  vmax  i  .>=1,2.3  (10) 

Unfortunately,  if  the  set  S  [  „  _]  contains  level  \  „  _j.  then  it  is  not  possible  to  add  \  to 
that  set.  because  of  the  dependency  between  Yu  and  In  this  case.  \  max'  be  aiiiled  to 
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the  next  smallest  set,  namely  S2.u-i-  However,  the  resulting  sets  may,  then,  violate  (10). 
More  specifically,  if  n2M- 1  —  D  ^  vmax,  then  it  is  possible  to  prove  the  following 

least  upper  bound: 

n2,u  ^  1  m  ^  v  m»x  D  ^  2v  max 

In  order  to  reduce  the  maximum  difference  in  the  sizes  of  the  sets  back  to  vmax,  let  w 
be  the  smallest  integer  such  that  one  of  the  following  is  true 

I  Vu  +i  I  +  I  Vu  +3  I  +  •  •  •  +  I  Vu  +w  I  £  D  ( Here  w  is  odd  ) 
or 

I  Vu  +2  I  +  I  Vu  +4  I  +  •  •  •  +  I  Vu  +w  I  ^  D  ( Here  w  is  even  ) 

If  w  is  odd.  then  add  Vtt+1,Vu+3  ■  •  •  ,Vu+v  to  5 Xm  and  Vu+2yu  +4.  '  '  '  -K  +„  -i  10  •  while 

if  w  is  even,  add  Vu+2,Vu+4  •  •  •  .Vu+W  to  SlM  and  Vu  +1,^+3  •  ■  •  ,VU+W-X  to  Siu  .  With 
this,  it  is  easy  to  show  that  at  level  u  +w  .  the  sizes  of  the  sets  S,  M  +u.  salisf y 

nj  ju+v  I  ^  vmax  i,y— 1,2,3  (11. a) 

and  that  at  any  level  u  +1  between  u  and  u  +w  the  sizes  satisf y 

^  ni  .u  +/  nj  m+1  ^  ^  2vmax  i  ,y -1,2,3  (li  b) 

After  restoring  the  difference  in  the  sizes  of  the  sets  to  vmax,  these  sets  may  be  reor¬ 
dered  (renamed)  such  that  nlu+u.  ^  n2u+)..  ^  n3,u+*  ■  and  the  above  procedure  may  be 
repeated.  More  precisely,  the  algorithm  may  be  described  as  follows: 

/*  n,  u  and  tc  will  be  used  to  denote  the  total  number  of  nodes  in  the  levels  contained 
in  S,  M  and  Tc  .  respectively  */ 

FOR  i  =1.2.3  let  S,  0  ”  the  empty  set 
u  =  0  : 

REPEAT 

u  =  u  +  1  ; 

Sort  and  rename  S,  i  =1.2.3.  such  that  n  , „  2„ -1  , u _j  : 

IF  (Vu  t  S,  u  _,)  THEN 


M 


V 

V 

V 

V 


i 


'A 


ri 


VnVvV 


Su  ~  ^1^-1  U  {Vu  }  ;  S2,u  -  5  2m-1  •  S  3m  ~  $3m-1  ■ 


ELSE  { 


Let  D  =  niM -i  -nlM_ ,.. 


Let  w  =0,  c  =1  and  rc  =  T  1-c  =  the  empty  set. 


WHILE  (t,  ^  D  and  u  +w  ^  m  )  DO 


{  w  =  w  +1 


c  =  1— c 


/*  switch  between  T 0  and  T  i  */ 


re  =rc  (J  <va+j  I 

$2m  +<*■  =  ^2*  -lUl^J  : 

SlM+w  =^U-iU^  ■  S 3m  +w  =^3*-lU^l-(  • 


U  =  U  +  W  } 


UNTIL  u  ^  m  (the  number  of  levels) 


The  performance  of  the  algorithm  is  established  in  by  the  following  proposition  which 


may  be  proved  from  (10)  and  (ll): 


Proposition:  At  any  specific  stage  u  of  the  above  algorithm,  the  number  of  nodes  in  the  sets 


S,  M  .  i  -1.2,3,  may  differ  by  at  most  2  vmjx.  where  vmix  is  the  size  of  the  largest  level. 


Given  that  the  levels  may  be  partitioned  into  three  sets  of  independent  levels,  and  that 


each  level  may  be  partitioned  into  <7  sets  (o' ^ 3)  of  independent  nodes,  then,  it  is  possible  to 


modify  the  algorithm  of  Section  2  such  that  3(7  lists  of  independent  nodes,  rather  than  2<7 


lists,  are  constructed  in  step  4.  The  advantage  gained  from  increasing  the  number  of  lists  is 


the  one  of  having  lists  of  approximately  equal  sizes.  More  precisely,  following  the  same 


reasoning  as  in  Section  3,  it  is  possible  to  show  that.  nl/r,  ^  (n  —  4vmix)/3  and  that 


Iri,  ^  (n  —  4vmJX  —  3((7— l)vjnax  )/3<r.  This  leads  to  the  following  bound  on  A 


A  £  -3.17vmax 


which  is  better  than  (9)  if  <*m,.  >  In  /9 cr  +  2.12v, 


.  1 .  ^  * 
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5.  ENHANCEMENT  OF  PERFORMANCE  USING  BACKTRACKING 

Both  bounds  (9)  and  (12)  depend  on  the  number  of  colors,  cr  which  are  needed  to 
color  the  nodes  of  the  levels  in  Ga  ■  Specifically,  larger  values  of  A  are  obtained  for  smaller 
values  of  cr.  Although,  it  was  proved  that  cr  <  3.  experimental  results  showed  that  very 
frequently,  two  colors  are  enough  for  the  partition  of  each  level,  and  that  in  only  few 
instances,  the  third  color  is  needed.  The  need  for  the  third  color  results  whenever  a  level 
contains  three  nodes  that  are  mutual  neighbors  (see  Fig  6(a)).  Such  nodes  clearly  form  a 


(a)  before  backtracking  (b)  after  backtracking 


Fig  6  -  Example  of  the  one-level  look-back  scheme 

By  a  slight  modification  in  the  level  structure  of  Ga  .  it  is  sometimes  possible  to  elim¬ 
inate  the  need  for  a  third  color.  More  specifically,  assume  that  V,.  •  ■  ■  .VH_ ,  are  2-colorable 
and  that  three  nodes  e  and  e3  form  a  cycle  in  level  Vu  .  If  one  of  these  nodes,  for  exam¬ 
ple  ev  has  only  one  neighbor  in  the  previous  level  V„_i.  then  the  cycle  involving  the  three 
nodes  may  be  broken  by  moving  ex  from  level  Vu  to  level  !/„_!  (see  Fig  6(b)).  The  new 
level  V'u-i  ~  Vu  (J  [e  i)  is  2-colorable  because  Vu_1  is  2-colorable  and  ej  is  connected  to 
only  one  node  in  Vu  The  following  levels,  namely  V'w  .  w  .  may  then  be  constructed 
from  V'u  in  the  usual  way. 

The  new  level  V'u  contains,  in  addition  to  all  the  nodes  in  l’„  —  !ej!.  the  neighboring 
nodes  of  ej  which  are  not  in  Vu_1  or  Vu .  Hence,  it  is  possible  for  V\  to  contain  a  cycle. 
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and  thus  be  3-colorable.  If  this  is  the  case,  then  the  above  backtracking  scheme  may  be 
repeated.  Note  that  repeated  backtracking  may  not  le*d  to  oscillation  because  in  each 
instance  of  backtracking  a  new  node  is  added  to  level  u  —1.  This  may  continue  until,  either 
level  u  is  2-colorable  or  until  a  cycle  is  encountered  for  which  backtracking  is  not  possible. 
In  other  words,  it  is  possible  to  construct  a  level  structure  in  which  each  level  is  2-colorable 
provided  that,  whenever  a  cycle  is  encountered  in  some  level,  one  of  the  nodes  forming  the 
cycle  is  connected  to  only  one  node  in  the  previous  level. 

The  above  one  level  look-back  strategy  may  be  extented  to  a  two  levels  look-back 
scheme.  More  specifically,  assume  that  three  nodes  e, .  i  =1,2,3.  form  a  cycle  at  level  Vu , 
and  that  each  node  C; ,  is  connected  to  at  least  two  nodes  in  Vu_1.  say  e,  ^  ,  k  ^2  (see  Fig 
7(a)).  In  this  situation,  the  one  level  look-back  fails.  However,  if  one  of  the  nodes  e,  ^  is 
connected  to  only  one  node  in  level  Vu,2.  then  we  may  add  this  node  to  Vu_2  and  form  the 
new  levels  V'w  .  w  ^u—2.  (see  Fig  7(b)). 


(a)  before  backtracking  (b)  after  backtracking 

Fig  7  -  Example  of  the  two-level  look-back  scheme 

The  two  level  look-back  scheme  should  be  applied  very  cautiously  because  it  may  lead 
to  execution  time  which  is  exponential  in  the  number  of  nodes.  More  precisely,  assume 
that,  in  order  to  render  Vu  2-colorable,  a  backtrack  to  level  u  —  2  is  needed.  Level  u—  1 
may.  then,  become  3-colorable.  and  backtrack  to  level  u  —  3  may  be  needed.  This  procedure 
may  roll  back  all  the  way  to  level  1.  In  order  to  guard  against  this,  some  limit,  b  miX,  may 


6.  EXPERIMENTAL  RESULTS 


A  program  which  implements  the  basic  numbering  algorithm  described  in  Section  2 
was  written  in  Fortran.  The  program  incorporates  the  backtracking  schemes  of  Section  5. 
and  an  input  variable  "BACK"  allows  for  the  choice  among  0.  1  or  2  level  look-back  back¬ 
tracking.  Another  variable.  "Bmax".  limits  the  number  of  levels  that  may  be  rolled  back  in 
the  case  BACK=2.  In  our  experiments.  Bmax  was  fixed  at  the  value  3.  A  stack  oriented 
approach  is  used  to  implement  the  algorithm  of  Section  3.2  for  coloring  the  nodes  in  a  par¬ 
ticular  level. 

The  implementation  of  the  numbering  algorithm  is  general  and  may  be  applied  to  any 
type  of  meshes.  However,  the  backtracking  facility  (BACK  >  0)  may  only  be  used  for  tri¬ 
angular  meshes.  Here  we  note  that,  for  non-triangular  meshes,  more  than  three  colors  may 
be  needed  to  color  the  nodes  in  each  particular  level. 

In  the  algorithm  of  Section  2,  2  cr  lists  of  nodes  are  formed,  where 

cr  =  max{ju  ;  };  cr  of  these  lists  were  formed  using  the  odd  numbered  levels,  and 

the  other  cr  using  the  even  numbered  levels.  In  implementing  the  algorithm,  a  decision  was 
made  to  distinguish  between  aeven  =  max(j„  ;  u  -even  }  and  cr^  =  max)i„  \  u=odd\. 
With  this.  areve„  +  crodd  lists  of  nodes  may  be  formed.  Clearly,  if  cr,0,  =  c +cr,ven  ,  then, 
crto,  <2  cr.  Although  the  theoretical  bound  (9)  depends  on  the  largest  of  crrxid  and  creven  ,  it 
was  found  that  the  value  of  A  may  increase,  slightly,  due  to  this  optimization. 

The  results  of  running  the  algorithm  on  three  test  problems  generated  from  the  irreg¬ 
ular  triangular  meshes  are  reported  in  Tables  1.2  and  3.  The  test  problems  are  generated 
from  the  three  meshes  M  1.  M  2  and  M  3  shown  in  Fig  8.  M  1.  which  is  extracted  from 
[23],  contains  145  nodes  while  M2  and  M  3,  which  are  extracted  from  [9],  contain  249  and 
449  nodes,  respectively.  For  each  mesh,  a  number  of  initial  levels  were  tried.  An  initial 
level  may  contain  only  one  node  (such  a  level  is  denoted  by  the  letter  ,V  in  Fig  8),  or  it 
may  contain  a  number  of  nodes  (such  a  level  is  depicted  by  a  dotted  bold  line  in  Fig  8  and 
denoted  by  the  letter  E ). 


F  rJ  rv-  ^  r.*  rjr  WTt  vm  Lrvvwvvv^n.  WT  *V\  nr  V.VVIM  W  H,"  *TW  ’nj'^V  i ry  wvrun'rj  wu  u- ;  w-., 
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(b)  The  mesh  M  2 
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(c)  The  mesh  M  3 

Fig  8  -  Examples  of  irregular  meshes 
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In  order  to  evaluate  the  effectiveness  of  the  backtracking  scheme,  each  problem  was 
tested  with  BACK  set  to  0.  1  and  2.  respectively.  For  each  run.  the  total  number  of  lists. 
c lo,  -  O'even  +0'0 dd  •  as  well  as  the  resulting  A  are  reported  in  the  tables.  \ote  that  blank 
entries  are  used  to  indicate  irrelevant  runs.  More  specifically,  if  ( Tlol  =  4  for  some  value  of 
BACK,  then  larger  values  of  BACK  may  not  improve  A. 


Initial 

BACK 

-0 

BACK 

=  1 

Level 

V<o< 

A 

V'ot 

A 

N , 

4 

33 

- 

- 

n2 

4 

33 

- 

- 

4 

32 

- 

- 

E  i 

4 

31 

- 

- 

E  2 

4 

33 

- 

- 

E  3 

4 

30 

- 

- 

e4 

5 

20 

4 

31 

Table  1  -  The  results  for  the  mesh  M  1  (n  =145). 


Initial 

BACK 

-  0 

BACK 

-  1 

BACK 

=  2 

Level 

O'.o. 

A 

<r10t 

A 

O”  101 

A 

N , 

6 

31 

5 

37 

5 

33 

n2 

6 

26 

5 

32 

4 

51 

6 

32 

5 

33 

4 

51 

6 

34 

5 

32 

4 

52 

Hs 

6 

33 

5 

35 

4 

50 

Ex 

6 

32 

5 

35 

4 

51 

E  2 

6 

26 

4 

47 

- 

- 

E  3 

6 

31 

4 

54 

- 

- 

e4 

6 

32 

5 

34 

4 

53 

E  5 

6 

29 

5 

33 

4 

52 

Table  2  -  The  results  for  the  mesh  M  2  (n  =249). 


The  first  observation  to  be  made  about  the  results  concerns  the  frequency  of  success  of 
the  backtracking  scheme.  Among  the  26  runs  reported,  the  scheme  failed  to  obtain  2- 
colorable  levels  only  twice.  Namely  with  M2  starting  at  N  j  and  with  M  3  starting  at  E  4. 
Also,  for  a  specific  problem,  larger  values  of  A  are  obtained  for  smaller  values  of  crw  . 
irrespective  of  the  initial  level.  For  example,  for  M  3.  A  lies  in  the  range  [50.63]  if  c rf0,  =6, 
and  in  the  range  [81,99]  if  cr,0,  =4.  This  indicates  that  the  algorithm  is  not  very  sensitive  to 
the  initial  level  as  long  as  backtracking  is  used  to  minimize  cr,„,  . 


-  28  - 


Initial 

Level 

BACK 

-0 

BACK 

=  1 

BACK 

=  2 

O’roi 

A 

<*,* 

A 

® lot 

A 

N  t 

6 

61 

6 

54 

4 

96 

n2 

6 

63 

6 

57 

4 

99 

n3 

6 

59 

6 

53 

4 

94 

n4 

6 

52 

6 

50 

4 

90 

6 

56 

6 

56 

4 

81 

Ex 

6 

50 

6 

58 

4 

93 

E  2 

6 

60 

6 

52 

4 

91 

E  3 

6 

56 

6 

53 

4 

81 

E  4 

6 

57 

6 

52 

5 

58 

Table  3  -  The  results  for  the  mesh  M  3  (n  =449). 

An  interesting  phenomena  may  be  observed  in  Table  3.  More  specifically,  given  the 
symmetry  of  M  3.  runs  with  starting  levels  El  and  E 2  would  be  expected  to  yield  the  same 
results.  So  are  runs  starting  from  E3  and  E4.  and  runs  starting  from  N  {  and  N  2.  This  is 
not  the  case,  however,  because  some  initial  numbering  has  to  be  used  to  input  the  mesh  into 
the  program,  and  even  though  the  mesh  is  symmetric,  the  initial  numbering  may  not 
guarantee  that  the  nodes  in  symmetric  triangles  are  handled  in  the  same  relative  order.  The 
relative  order  of  nodes  is  particularly  important  when  backtracking  is  used  because,  if  more 
than  one  node  may  be  used  to  break  a  cycle  in  a  level,  then  the  first  one  encountered  is  used. 

The  results  also  show  some  expected  anomalies.  More  specifically,  a  A  obtained  with 
BACK-fe  may  be  worse  than  the  one  obtained  with  BACK*!:  —1.  For  example.  M  3  starting 
from  N  i  produced  A  =  61  and  54  for  BACK  *  0  and  1,  respectively.  This  happens  because 
backtracking  tends  to  produce  larger  levels,  thus  increasing  v  max.  Hence,  if  the  backtrack¬ 
ing  procedure  fails  to  reduce  c r,ot  .  then,  by  (9).  larger  vmax  leads  to  smaller  A. 

Although  finding  the  best  A.  namely  Aap,  is  an  NP-hard  problem,  it  is  easy  to  see  that 
for  any  triangular  mesh  with  n  nodes.  Aop,  ^  y.  In  other  words,  the  best  A  for  M  1.  M  2 

and  Af3  may  not  exceed  48,  83  and  147.  respectively.  Clearly,  our  algorithm  produces 
A  >  0.5Aopr  in  linear  time. 


7.  CONCLUDING  REMARKS 


The  Multicolor  algorithm  given  in  Section  2  is  a  general  numbering  algorithm  which 
may  be  applied  to  any  graph  corresponding  to  an  irregular  mesh.  However,  the  zero  stretch 
resulting  from  the  algorithm  is  inversely  proportional  to  the  number  of  colors  cr  needed  to 
color  each  level  in  the  graph.  For  3-node  triangular  meshes,  it  is  proved  that  cr  $  3.  and  bv 
applying  the  same  technique  used  in  Section  3.  u  is  possible  to  prose  that,  for  4-node  rec¬ 
tangular  meshf  cr  <  4.  Similar  tight  upper  bounds  on  cr  do  not  seem  easy  to  establish  if 
the  nodes  are  not  confined  to  the  corner  of  the  elements  and  if  the  number  of  nodes  in  each 
element  is  not  fixed.  These  types  of  meshes  arise  if  high  order  elements  are  used  or  if  some 
adaptive  mesh  refinement  techniques  are  applied  [22].  Further  research  is  required  to  deal 
with  such  meshes. 

It  has  been  reported  [8.  15.  17]  that,  for  linear  systems  resulting  from  regular  grids, 
the  convergence  rate  of  the  PCCG  method  deteriorates  w  hen  the  zero  stretch  is  increased  by 
renumbering  the  nodes  of  the  grids  (the  renumbering  also  increases  the  bandwidth).  In 
fact,  the  results  in  [15]  suggest  that  the  convergence  rate  is  inversely  proportional  to  the 
zero  stretch  and/or  the  bandwidth  of  the  matrix.  This  effect  also  exists,  in  a  milder  form, 
in  systems  resulting  from  pierced  rectangular  grids  [15],  which  are  slightly  irregular  grids. 
If  future  experiments  show  that  the  same  effect  persists  for  irregular  grids,  then  it  will  be 
interesting  to  study  possible  modifications  to  the  multicolor  algorithm  that  will  maintain  a 
given  zero  stretch  (specified  by  the  architecture  of  a  supercomputer,  for  example)  while 


minimizing  the  bandwidth 
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